Universal approximation of multi-copy state 
and universal quantum lossless data compression 
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We have proven that there exists a quantum state approximating any muhi-copy state universaUy 
when we measure the error by the normalized relative entropy. While the qubit case was proven 
by Krattenthaler and Slater (IEEE Trans. IT 46, 801-819 (2000)), the general case has been open 
for more than ten years. For a deeper analysis, we have solved the mini-max problem concerning 
'this approximation error' up to the second order. Further, we have applied this result to quantum 
lossless date compression, and have constructed a universal quantum lossless data compression. 



I. INTRODUCTION 

Does there exist a quantum state approximating the 
n-copy state universally? Such a state does not seem to 
exist, however, if we measure the difference between two 
states by the normalized relative entropy, such a state 
exists. In the classical case, i.e., the non-commutative 
case, Clarke and Barron [J] shown that the mixture state 
of n-fold tensor product states approximates all of n-fold 
tensor product states. More precisely, they proven that 
the following. Consider the set of commutative desities 
{pe\9 e e C R"}e on H := C^, and a prior distribution 
fi on Q. They shown that the mixture state Pw,n ■= 
/e pew{d)dd satisfies the relation 
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y log ^ + 2 logdetVm-^ogw{0), 

(1) 



where I (9) is the Fisher information matrix (See 0). In 
this paper, we choose the logarithm base 2. In particu- 
lar, Clarke and Barron [3] proven that the mini-max of 
the constant term is given when the prior w is chosen as 

the Jeffreys' prior — Vdct /(e) — ^ replace the rel- 

f y/dct i(e'}de' ^ 
ative entropy by another measure satisfying the axiom 
of the distance, no state satisfies the relation That 
is, since the relative entropy does not satisfy the axiom 
of the distance, such a state can be exist. Concerning 
the quantum case, in 1996, Krattenthaler and Slater[4| 
shown its quantum extension in the qubit case. However, 
the general case has not been shown for more than ten 
years. Their paper has not proven the mini-max problem 
for the constant term completely. 

In this paper, we prove the existence of the states (T„ 
on Ti*^" satisfying that 



logn + 0(l) 



(2) 



for all of faithful states p on TC, i.e., the state p belongs to 
the set S {p\ rankp = d}. Since the dimension of the 



state family S is d^ — 1, the relation ^ can be regarded 
as a natural quantum extension of ([T]). More precisely, 
we calculate the following mini-max value 



min sup lim 



Dip^K) 



1 



logn 



(3) 
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which is the main result in this paper. Krattenthaler and 
Slater treated the same problem among a restricted 
class of states {cn} in the qubit case. 

The term (jS]) has the meaning of the approximation in 
the sense of information processing. Recently, Hayashi Q 
provided universal coding for classical-quantum channel. 
In his derivation, the approximating state (T„ is used in- 
stead of the true state p®" . The reason why this replace- 
ment well-works seems that this type information-sense 
approximation holds. 

In the classical case, this relation is considered as the 
asymptotic redundancy of the prefixed variable-length 
lossless data compression [l], Any distribution cor- 
responds to the prefixed variable-length data compres- 
sion through Kraft inequality, and the minimum aver- 
age length is the Shannon entropy When the applied 
code corresponds to the distribution (T^,„, the relation 
P]) expresses the asymptotic redundancy when the true 
state is p®". This fact means that good information- 
sense approximation implies a small redundancy in the 
prefixed variable-length lossless data compression. That 
is, the relation ([T|) guarantees the existence of universal 
variable-length lossless data compression in the classical 
case [Hi, 0,11. 

In the quantum case, it is known that there exists uni- 
versal fixed-length data compression^, [l3|. However, 
the quantum analogue of variable-length lossless data 
compression is not simple pH In order to determine the 
size of the memory storing the quantum information, we 
have to measure the length. This measurement demol- 
ishes the quantum state while the degree of the state 
demolition can be reduced to infinitesimal Hence, 
it is impossible to interpret the length of the quantum 
case in the same way. Therefore, we have to prepare 
the memory whose size is the maximum length of the 
given code in the framework of variable-length lossless 
data compression (Tsj . In order to avoid this problem, 
we regard the length of code as the energy and treat 
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the average energy because the energy is not necessar- 
ily to be measured. In this paper, we formulate quantum 
variable-length lossless data compression in this way, and 
explain that the relative entropy £'(/9®"||ct„) expresses 
the asymptotic redundancy. 

This paper is organized as follows. In section [TTl we 
prepare a basic knowledge of representation theory, and 
show ([2|). In section Uni using the relations given in sec- 
tion ini we calculate ([3]) . In section IIVI we numerically 
calculate ([3]) in the qubit case, and compare the result 
by Krattenthaler and Slater Q • In section |Vl we treat 
quantum variable-length lossless data compression, and 
apply our result to this topic. 



II. REPRESENTATION THEORY AND 
APPROXIMATION OF MULTI-COPY STATE 

In order to treat information-sense approximation, we 
focus on the dual representation on the n-fold tensor 
product space by the the special unitary group SU(d) 
and the n-th symmetric group S'„. (Christandl 14] con- 
tains a good survey of representation theory for quan- 
tum information.) For this purpose, we focus on Young 
diagram and type. When the vector of integers n — 
(ni,n2, . . . ,nd) satisfies the condition ni > n2 > ■ ■ ■ > 
TT-d ^ and X]f=i '^i ~ vector n is called the Young 

diagram (frame) with the size n and the depth d, their set 
is denoted by Y^f. The number of these sets is evaluated 
by 



\Y:!\<{n+l) 



(4) 



Since the sets {(ns(i))|n £ r,f} and {{ns'{i))\n G T^} are 
distinct for any s ^ s' € Sd, the relation |{''?| Jt.! — 
n} I = implies the asymptotic behavior of the car- 

dinality lY^'^l: 



1)! 



(5) 



Using the Young diagram, we can characterize the ir- 
reducible decomposition of the dual representation of 
SU(d) and Sn- 

where Uf{ is the irreducible representation space of SU((i) 
characterized by n, and Va is the irreducible representa- 
tion space of n-th symmetric group Sn characterized by 
ft. Here, we denote the representation of n-th symmet- 
ric group Sn hy V : Sn 3 s 1-^ Vg. According to Weyl's 
dimension formula, the dimension Ufi has the expression 



dimZifi = Y\_ 



< n 2 . (6) 



for any n S F,f, That is, for a given probability distri- 
bution p = (pi, . . . ,pd) on {1, . . . , d} with the condition 
Pi> P2> ■■■ > Pd, when (pi, . . . = • • • , if), 



-n 2 



(7) 



2rf-i3d-2...(rf_ 1)1 

Then, we denote the projection to the subspace Uf{ (g) Vfj 
by 7^, and define the following. 

_ 1 



(8) 



Any state p and any Young diagram n € satisfy that 

Thus, (HI), dSl), and dS]) yield the inequality 

(n + l)^^^at;,„>p«". (9) 
Since (Ju_n is commutative with p®^, we have 

(d+2)(d-l) „ 

log(n -I- 1) 5 -I- logau^n > logp^". 
Thus, we obtain 

Dip^^au^n) =Trp®"(logp«" - logau.n) 
.(d + 2)(d- 1) 



<- 



■log(n + l). 



Therefore, the state au,n universally approximates the 
state in the sense of the normalized quantum relative 
entropy: 

-D{p^^au,n) - 0. 
n 

Now, we calculate the value D{p'^"\\au,n) more pre- 
cisely. In the following calculation, we assume that 

d 

pip) = YP^V){'i'l 
i=l 

where p = {pi, . . . ,pd) is a probability distribution on 
{1, . . . ,d}, and is the orthonormal basis of Ti. 

In this calculation, it is essential to calculate the average 
of the random variable log \ Y^\ + log dimUfi + log dim Vh 
under the distribution Qp{n) :— Tr In order 
to treat X^ngy^ '5p(") logdim asymptotically, Mat- 

sumoto and Hayashi [l^ introduced the quantity ^ := 

ni<.n2l--na< ' ^''^ their Appendix D, they shown that 

n! 

V Qp{ri) (log dim Vn - log — ) 
^ — ' n! 



E 



■log- 



-'^ogYlip, ~ pj 



sgn(s)n»pf'" 
ses, n,;<,(P»-P.) 



(10) 
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where 5i 



i and we applied the formula we treat the mini-max problem concerning the constant 



E.GSrfSgn(s)ni?3/'" = n»<j(K -Pj)- In their Ap- 
pendix C, they calculated X^sgy"* Qp{^) 



term, which is the second order term. The following is 
the main theorem. 



5Z <9p(^)iog|y 

=H{p)n — log n — log 27re - - log 



Pi 



(11) 



Theorem 1 We obtain the following mini-max value: 



The combination of dS]), and ^ yields that 

J2 P(^)(log |i^„^| + logdimWs + log dim Vs) 



£ -1 

minsup lim ( _D(p'^"||fT„) logn 



=H{p)n + log]^(pj - 
d-1 



--Cd + log 



yd 



dp, 



(12) 



1 1 „ 1 
log TT- log 2i'e - ^ 2^ log Pi 



log 



1)! 



2 
log 



2rf-i3<i-2...(d_ 1)1 



=H{p)n 



1 , 1 , „ 

— log n log 2ne 



log 2 



d-lod-2 



■ ■■{d-iy 



logd!(d-l)! + 21og[](p,-pj) 



t<3 



E 

sGSd 



Sgn(g)^^P^'" 



logH^^''"' - ^^E^og^'^ 



+ ^--^ logn + - logd!(d - 1)! + ^(p), 



where 



d-1 



log27re - log2''"^3' 



-lQrf-2 



(d-1)^ 



2 log ~ Pi) ~ ^ E ^°g^ 



Hence, 



d2-l 



= -Trp®"logfH/,„-ni/(p) 
logn + Cd- log d!(d-l)! + C(p). 



-where := {p = (pi,p2, ■ • • 1 - Pi - . . . - 

Pd-i)bi > P2 > ■•• > Pd-i > 1 - pi - ... - 
Pd-i > 0}, and dp dpidp2 ■ ■ -Pd-i- The above mini- 
max value is realized when we choose the mixture state 
'^J,n '■— X^nGy Jn{n)Pn with the distribution Jn{n) :~ 



This mini-max value is also attained 



by the mixture state aj^n '■= Jyd p{p)^^Jip)dp, where 
p{p)®" is the mixture concerning the invariant measure 
M on SV{d), I.e., J^^^^^{Up{p)W)^"p{dU). 



Since the Jeffreys' prior gives the mini-max solution in 
the classical case, the distribution J{p) on can be 
regarded as a quantum extension of Jeffreys' prior. 

Since the state p^" is invariant for permutation, 
-D(/3^"||cr„) = -Tr[p«^"log(T„] - nH{p) is equal to 
- Tr[p»" log V;cr„V;t] - nH{p) for any s G Sn- Thus, the 
operator convexity of the function x — log x implies 



-Tr[p«"loga„]-ni/(p) 
= E ^Tr[p®"(-logKa„K^)] -ni/(p) 

> - Tr[p«"log J2 T^^VsTnV:] - nH{p). 



ses„ 



III. MINI-MAX PROBLEM 

From the discussion of the above section, it is pos- 
sible to reduce the asymptotic approximation error 
D{p^"\\(T(j^ri) to 2^ logn universally. In this section, 



Since the state X^sss ji~\^s<^nV^ is invariant for the 
action of 5„ , we can restrict our states (T„ in (I12p to per- 
mutation invariant states. Due to the unitary covariance 
and the operator convexity of the function x — log a;. 
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the invariant measure /i on SU(rf) satisfies that 



sup hm ( Z?(p«"||a„) - ^1-ilogn 



sup lim sup i:'(p(p)®"||C/®"cr„(t/®")t) 



/yd Qj,n{p,n)dp) approaches J„(n) because the variable 
^ approaches p in probabiUty Qp[3 and J{p) is contin- 
uous. The variable p approaches ^ in probability under 
the conditional distribution Qj^nipl^)- Then, 



,^ n — ^oo 



UeSU{d) 



■ log 



= sup sup lim -Trp(j?)®"logf/®"cr„(C/®")1' 

p (76SU(d) 

£ -1 

- nH{p) — logn 

>sup lim - / Trp(p)®"logC/®"a„(C/®")V(rft/) 

p JSU(d) 

- nH{p) — logn 

>sup lim -Tr/3(p)®"log ( / 
p \Jsv{d) 



[/®"(T„([/®")V(rft^) 



nH{p) — logn. 



Since the state ( /gu(rf) ?7®"cr„(J7®") V(rfC^)) is invariant 
for the action of SU(d), we can restrict our states (t„ in 



T2| to states invariant for SU(c?). That is, we obtain 



logn 



■logn , 



inf sup lim 
: inf sup lim I D{p'^''\\ap 



where P„ is the probability measure on and trp^ 
Lrier" Pn{n)Pn- Siucc 



yd 



we have 



^(P(p)®"lkp„,n) 



P (fl) 

logn + Cd - logd!(d - 1)! + Cip) - log 
logn + Cd - \ogd\{d - 1)! + log( / e^^P'Up') 



+ log J{p)^Y. (") log ^" (") dUd-l)\ 

n 

= ^-^ log n + Crf + log / e'^f'^'dp' 

+ I] Qp(^)(log J(p) - log P„(n) - (d - 1) logn). 

n 

(13) 

Consider the joint distribution Qj,n{p,n) := 
J{p)Qf{n). The marginal distribution Qj,n{n){= 



Y<i Jya Jn{p)Pp{n)dp 



dp 



= log J - = log J„ n +log— 7 can^-' 

n jyac^yp'dp 

= log J„(n) + {d- l)logn. 

Only the second group depends on p in (|13p . The supre- 
mum of the second group of (jl3p concerning p is evaluated 
as follows. 

sup Qp{n) (log J{p)- log P„ (n) - (d - 1) log n) 

> / V Op-(n)(log Jip) - logP„(n) - (d - 1) logn) J(p)dp- 
= ^ (^) (log J„ (n) - log Pn (n) ) 

n 

=i^(J«||P„)>0. 
Hence, 



lim VP„(n)(I?(p(-)«^"||ap„,„ 



d^-1 



logn) 



>Cd + log/ eC'(P^dp. 

/yd 



The equality holds when Pnin) = J„(n). Therefore, 



minsup hm ( L'((0'^"||(t„) — logn 



=Cd + log / e^^f^dp. 



Note that the existence of the minimum value is 
proven here. Since the state p{p)®'^ has the 
form J2neY'^ Qpi''^)Pfi^ state aj^n has the form 

J2neY^Qj,nin)pn- Bccause Qj^nin) approaches J„(n), 
the state (Tj_„ also attains the mini-max in p2p . 



IV. QUBIT CASE 

In the qubit case, C2 and C{p) are calculated as fol- 
lows. 

C2 = -ilog27re-log2 
C[p) =21og(pi -P2) — — logpi 

Pi ~P2 
r>2 1 

H logP2 - 7:(logPi + logp2)- 

P1-P2 2 
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Then, the term log e'^^P^dp is numerically calculated 
to -0.50737. We obtain 

minsup lim (L>(p^"|lcr„) - ^logn) = -3.5545. 

On the other hand, Krattenthaler and Slater f3| focus 
on one parameter family {01(^)1 — oo < u < 1}, whose 
elements are given as the mixtures of n-copy state for 
invariant measures Qu, which given in (1.7) of Q. They 
numerically calculated that 

minsup lim {D{p^"\\Cn{u)) - - \ogn) ^ -2.3956. 

ti pes n^oo 2 

Note that they derived the above value —1.66050 with 
the natural logarithmic base. 

This result does not contradict our result because we 
take the minimum concerning all sequence of states {cr„} 
while they take it concerning specific sequences of states 
{C„(m)}. The difference -2.3956 - (-3.5545) = 1.1589 
expresses the improvement for choosing the optimal ap- 
proximating state. 

V. MINIMIZATION OF THE AVERAGE 
ENERGY 

In this section, we apply obtained result to quantum 
variable- length lossless coding. First, we give a formula- 
tion of variable-length lossless coding. For this purpose, 
we focus on the Fock space: 

We := 0(C2)®^ (14) 

k=0 

When we use this space for storing quantum information, 
we cannot determine the length of stored state without 
state demolition. 

Hence the variable-length lossless coding is formulated 
as follows. When we compress the quantum state on the 
n-fold tensor product system of 'H{= C^), i.e., Ti®", the 
encoder is given by the isometry [/„ from 7^"^" to Hq, 
and the decoder is given as the inverse map from the 
image C/„ to H®". 

Note that our definition is more general than Bostrom 
and Felbinger [ll[ because they assume that the com- 
pressed state of the basis state belongs to the space 
(C^)®", i.e., is not a superposition of different-length 
states. Further, when we store quantum state by the 
Fock state, we do not have to know the length of the 
stored state. Alternatively, the average energy is an im- 
portant quantity for physical realization because a higher 
energy damages the communication channel and the stor- 
age. Note that the energy is not necessarily to be mea- 
sured. In this case, it is natural to treat the Hamiltonian 
H := X^fcLo ^-f'fci where Pk is the projection to the space 
(C^)®'°. In this case, when the initial state is given by 
p, the average energy is given as Tt HUpW . Hence, it 



is suitable to consider the minimization of the average 
energy for a given ensemble {{pi,Pi)} on H**". 

For any lossless code J7„ on 7i®", we can choose the 
density cr(C/„) on H®" such that 

-Trplogcr(C/„) - logr7ilogd] < Ti HUpU'' (15) 

for any state p on Ti®". Since the relation 
-TrpplogfT([/„) - H{pp) = D{pp\\a{Un)) holds for the 
mixture of the ensemble pp := '^^PiPi, the average en- 
ergy HU ppU'^ is greater than H{pp) — log2(nlog2 d). 

Next, we consider the small class of quantum lossless 
codes. When we concatenate two general quantum loss- 
less codes, the concatenated lossless code is not neces- 
sarily determined. In order to avoid this problem, we 
consider the prefix quantum lossless code. A code C/„ on 
Ti®" is called prefix when there exists a basis {|ei)}^^j^ 
on Ti®" (the base \ei) does not necessarily have the ten- 
sor product form.) such that the state C/„|ei) has the 
form \(f>i{i) ■ ■ ■ (f>k{i)) € (C^)®*^, and the classical code 
(f){i) := {4>i{i) ■ ■ ■ (j)k(i)) satisfies the prefix condition. 
Since the classical concatenated code of two classical pre- 
fix codes can be defined, the concatenated quantum loss- 
less code of two prefix quantum lossless codes can be de- 
fined. Hence, it is natural to restrict our quantum lossless 
codes to prefix lossless codes. 

As is known in information theory [6], any classical 
prefix code (j) satisfies Kraft inequality 

^2-l'^WI < 1, 

i 

where \4>{i) \ is the coding length. For any prefix quantum 
lossless code [/„ , we choose a classical prefix code </) such 
that Un\&i) — |0(*))- Then, a quantum version of Kraft 
inequality 

Tr^2-<^l^"^^"l^>|e,)(e,| = ^2-l^«l < 1 (16) 

i i 

holds 

Conversely, as is known in information theory for 
any probability distribution pi , there exists a prefix clas- 
sical code (j> such that 

-logp, < |(/.(i)| < -logp, + l. (17) 

Hence, for a given state a on Ti®", we focus on its diag- 
onalization a = '^iPi\i){i\- Then, we choose the prefix 
classical code satisfying ([TT]). Then, we can define the 
prefix quantum lossless code [/„ by 

The prefix quantum lossless code J7„ satisfies that 

-Trplogcr < Ti-HUpU"^ < - Tr p log ct -f- 1 

for any state p on Ti®". That is, a prefix quantum loss- 
less code almost corresponds to a quantum state. There- 
fore, we can identify a prefix quantum lossless code with 
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the corresponding quantum state. Since the relation 
-Trpplogcr(;7„) - ff(CTp) = D{pp\\a{Un)) holds for the 
mixture of the ensemble pp, the minimum of the average 
energy Tr HU ppU'^ among prefix quantum lossless codes 
is almost equal to the von Neumann entropy H{pp). Fur- 
ther, the inequality (fTSj) guarantees that if we remove the 
prefix condition, it is impossible to improve the average 
energy so much. 

When we use a prefix quantum lossless code corre- 
sponding to a state a and the true mixture state is Pp, 
the average energy is 

- Trpp log (7 = H{pj,) + D{p.p\\a). 

Therefore, the relative entropy D{pp\\a) can be regarded 
as the redundancy of the quantum lossless code a with 
respect to pp. 

When the true mixture is p®" and the prefix quan- 
tum lossless code crp„,n defined in Section [llll is applied, 
the asymptotic redundancy is given by (|13p . Hence, the 

,2_i 

mini-max asymptotic redundancy is ^ log n + Cd + 
log Jyd e'"^^ dp, which is attained when we choose the 
mini-max code (Tj^„ or d'j^n- Therefore, the prefix quan- 
tum lossless code crj_„ and aj^n can be used as universal 
quantum lossless data compression. 

VI. DISCUSSION 

We have found a sequence of states {cr„} such that the 
relative entropy I?(p®"||cr„) behaves as ~ ^ log n uni- 
versally. While this result known only in qubit case, the 
general case had been open for more than ten years. In 
this derivation, Matsumoto and Hayashi [l^'s calcula- 
tion plays an essential role. Further, we have solved the 
asymptotic mini-max problem concerning £'(p®"||f7„) — 

2 log"-- It has been checked that our optimal value 
better than Krattenthaler and Slater [4]'s result in the 
qubit case. Our discussion is different from the original 
Clarke and Barron[ll [^'s discussion because they con- 
sider the optimization among mixtures of n i.i.d. distri- 
butions. Our method can be translated to the classical 
case when the family is given as the full multinomial dis- 
tribution family. In such a case, the derivation of mini- 
max problem can be expected to be shorter than the 
original derivation fl], f^. 

In addition, we have revisited quantum variable-length 
lossless data compression, and characterized the average 
energy (length) by (fT5|) for the general case. This result 
has a similar form as Theorem 5 in , but is a little 
different from their result. In addition, our definition of 
quantum lossless code is more general. Then, our result 
is stronger. While they used the method of majorization, 
we do not use it. 

Concerning the prefix case, we have derived the corre- 
spondence between the prefix quantum lossless code and 
a density matrix through a quantum version of Kraft in- 
equality pop . Using this relation, we have applied our 



result ([2]) and (|12p to quantum variable-length lossless 
data compression. It has been clarified that it is possible 
to compress multi-copy state universally by lossless code 
CTj,n or aj.n. 

Further, the relation between obtained approximation 
and other universal protocols are not discussed in this pa- 
per. This relation will be treated in a future paper [18|. 
Obtained universal approximation may be applied to 
other topic in quantum information [l9|. Hence, it is 
interesting to discuss this application. 
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APPENDIX A: PROOF OF (flHj) 

Let Pk and be the projections to the space and 
the image of the code C/„, respectively. We denote the 
projection to the range of operator X by P{X). Now, we 
define the projection Ak := A^-i — P{Ak-iPkAk-i) in- 
ductively. Since the projection P(Ak~iPkAk-i) is com- 
mutative with Ak-i, Ak is a projection. Since the rank 
of A is finite, there exists a number K such that = 
for k > K. Thus, ^0 = ©f^iP(Afe_iPfcylfc_i). Define 
the unitary V and the projection B 

P{Ak-iPkAk^i) ^VPiPkAk-iPk)V^ 

B:=(bLi P{PkAk-iPk)- 

Then, ~ VBV'^. Since all non-zero eigen- 

values of P{Ak-iPkAk^i)HP{Au-iPkAu-i) are 
greater than fc, P{Ak-iPkAk-i)HP{Ak-iPkAk-i) > 
VP{PkAk-iPk)HPiPkAk-iPk)Vl Thus, AoHAo > 
VBHBV^. 

Now, we construct a state cr(J7„). We choose a ba- 
sis on the range of B such that \fi){fi\ is com- 
mutative with all of P{PkAk-iPk)- Using the ba- 
sis jej := UXV\fi), we define the matrix (7([/„) := 
2^ '^^^^^^f^'i \ei) {ei\. The average length satisfies that 

Tr UnpUlH = Tr Ur^pUiA^HA^ 
>Tr UnpUlVBHBV^ = - TrplogCT(C/„) 

for any state p. This inequality guarantees that it is 
sufhcient to treat the code V^Un instead of [/„. 

Next, we calculate Tr(5-(J7„). we reorder je^) with the 
order increasing {f^\H\fi). When M = 2^ + 2"^ + 2^ + 



+ 2'"(> 2"), we have 



M 



Thus, 



2=1 



M' 



i=l 



for any M. Since dimT^®" = d", 

Tra([/„) < [nlogd]. 
Defining the state a{Un) := TrCT(j7 we obtain 

Tr C/„pC/ti/ > - Tr p log a(C/„) - logfn log d\ 
for any state p. 
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